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RESEARCH  OBJECTIVE 


The  goal  of  this  project  is  to  determine  if  Biemann's  approach  to  the  sequenc¬ 
ing  of  polypeptides  by  gas  chromatography- mass  spectrometry  can  be  carried  out  on 
the  bench  top  gas  chromatograph-mass  spectrometer-data  system  (GC-MS-DS)  developed 
by  Hewlett  Packard  (HP). 


STUDIES  COMPLETED  DURING  YEAR  1  (11/1/86-10/31/87) 

Goal  1:  To  implement  PEPALG  and  LSTGEN  on  the  HP9816  microcomputer 

PEPALG  and  LSTGEN,  the  algorithms  required  for  the  automated  interpretation  of 
the  mass  spectra  of  polyamino  alcohols,  were  redesigned  into  modular  form  from 
FORTRAN  source  code  supplied  by  Klaus  Biemann  and  coded  in  PASCAL  (revision  3.1).  In 
addition,  a  large  number  of  support  procedures  were  implemented  and  integrated  into 
a  user  interface  that  makes  PEPALG  an  interactive  program  in  the  QUICKSILVER 
environment  (the  proprietary  instrument  control  and  data  acquisition  sottware 
developed  by  HP).  These  algorithms  were  then  used  to  interpret  the  mass  spectra 
obtained  subsequently  from  glucagon  and  Tumor  Necrosis  Factor  (TNF)  [see  goal  3 
below). 


Accession  Fop 

NTIS  GKA4I 
DTIC  TAB 
Unannounced 
Justlf lcatlt 

S 

□ 

>n _ _ . 

Bv  .  1 

Distribution/  i 

Availability  Code3 

Dist 

M\ 

Avail 

Spec 

and/or 

Lai 

Goal  2:  To  perform  the  studies  on  the  HP5890A  capillary  GC  and  the  HP5970B 
quadrupole  MS  necessarv  to  determine  the  required  constants  used  in 
PEPALG  and  LSTGEN  ' 

A.  Determination  of  the  GC  conditions  for  implementation  of  the  Kovats 
retention  index  (RI)  scheme. 

The  GC  conditions  that  would  allow  all  of  the  components  of  a  mixture  of 
alkanes  of  the  form  CnH2n+2  (n=12-44)  to  elute  from  the  capillary  column  during  a 
linear  temperature  program  were  determined  from  a  large  number  of  experiments 
performed  with  different  stationary  phases  and  different  temperature  programs.  The 
use  of  these  GC  conditions  make  it  possible  to  estimate  the  retention  index  of  any 
unknown  compound  by  linear  interpolation  between  the  RFs  of  hydrocarbons  that 
differ  by  at  most  four  carbon  atoms.  The  GC  conditions  are  as  follows:  HP  1  ULTRA 
capillary  column  (film  thickness  0.33  tiM,  I.D.  0.20  mM,  length  12  M);  initial 
temperature  70°C,  final  temperature  310°C,  rate  1 .5 “C/minute,  final  hold  time  5 
minutes.  The  total  analysis  time  is  165  minutes. 


B.  .Synthesis  of  model  tripeptides  for  indexing  the  LSTGEN  library 


A  series  of  nineteen  tripeptides  of  the  form  Ala- X- Ala  Where  X  is  one  of  twenty 
amino  acids  was  prepared  with  a  Biosearch  Model  9500  peptide  synthesizer;  Ala-Ala- 
Ala  was  purchased  from  Sigma.  The  purities  of  the  model  tripeptides  were  determined 
by  HPLC  on  a  C18  column  (Table  1). 
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Table  1 


HPLC  Properties  of  Ala-X-Ala 


Peptide 

Retention  Time  (min) 

%  Purity 

A  la- A  la- A  la 

7.57 

92 

Ala- Val-AIa 

4.75 

100 

Ala-Leu-Ala 

12.1 

100 

Ala-Ue-Ala 

10.0 

96 

Ala-Pro-Ala 

17.6 

88 

Ala-Phe-AIa 

34.7 

100 

Ala-Trp-Ala 

21.3 

88 

Ala-Met-Ala 

6.47 

88 

Ala-Gly-Ala 

6.91 

87 

Ala-Ser-Ala 

6.87 

91 

Ala-Thr-Ala 

8.37 

83 

Ala-Asn-Ala 

6.39 

79 

Ala-Gin-Ala 

7.78 

82 

Ala-Cys-Ala 

18.9 

100 

Ala-Tyr-Ala 

7.47 

100 

Ala-Asp-Ala 

7.06 

88 

Ala-Glu-Ala 

9.53 

86 

Ala-Lys-Ala 

8.35 

100 

Ala-Arg-Ala 

10.4 

100 

Ala-His-Ala 

8.61 

98 

C.  Implementation  of  the  Chemistry  for  Synthesizing  Poiyamino  Alcohol 
Derivatives  of  Peptides 

Standard  solutions  (4.0  pg/mL)  of  model  tripeptides  were  prepared  in  0.005  M 
acetic  acid  that  had  been  sparged  with  N%  200  Mg  (=800  nmole)  of  the  tripeptide 
were  added  to  4  mm  PYREX  test  tubes  that  had  been  heated  for  30  minutes.  The  samples 
were  immediately  placed  in  a  PICO.TAG  chamber  and  the  acetic  acid  was  removed  under 
vacuum  (60  mTorr). 


The  detailed  procedures  published  by  Biemann  were  used  to  implement  the 
following  sequence  of  chemical  reactions: 

1.  Esterification  with  diazomethane 

2.  Acylation  with  methyl  trifluoroacetate 

3.  Reduction  with  BD3/THF 

4.  Siiylation  with  pvridine/TMSDEA 

Peptides  containing  arginine  or  cysteine  undergo  an  additional  derivatization  step, 
outlined  below,  before  they  are  submitted  to  the  above  sequence  of  reactions: 

1.  Hydrazinalysis  with  hydrazine  hydrate 

2.  Carboxymethylation  with  iodoacetic  acid. 

The  major  changes  that  have  been  made  in  the  derivatization  chemistry  are:  1) 
small  reaction  vessels  (Pyrex  6x50  mm)  have  been  used;  2)  reactions  have  been  scaled 


down  so  jiL  quantities  of  reagents  are  used;  3)  the  reaction  have  been  performed  in 
the  PICO.TAG  workstation  so  that  conditions  can  be  reproduced  exactly;  4)  all 
reactions  have  been  performed  in  an  inert  atmosphere  (dry  N2);  5)  all  evaporation 
steps  were  performed  with  controlled  vacuum. 


D.  GC-MS  properties  of  the  Derivatized  Tripeptides 

The  polyamino  alcohol  derivatives  of  the  model  tripeptides  were  analyzed  in 
the  GC-MS  system  under  the  conditions  that  had  been  determined  in  our  previous  study 
of  the  hydrocarbon  standard:  transfer  line  320°C;  solvent  delay  7.0  minutes;  low 
mass  60;  high  mass  800;  scans/second  1.17.  The  computed  retention  indices  and  the  RI 
increment  for  each  amino  acid  are  contained  in  Table  2;  the  contribution  of  the 
constant  portion  of  the  tripeptides  to  the  Rl  is  615.  It  should  be  pointed  out  that 
these  RI  increments  are  similar  to  those  derived  from  the  packed  column  work  carried 
out  by  Biemann  except  that  I(Leu)  and  I(Ile)  are  different  because  of  the  resolution 
of  the  capillary  column  used  in  these  experiments;  therefore,  PEPALG  can  distinguish 
Leu  and  lie. 


Table  2 

Retention  Index  Increments  for  Amino  Acids 


Peptide 

Rl 

I(X) 

Ala-Ala 

1259 

Ala- Ala- Ala 

1569 

l(Ala)  -  310 

Ala-Val-Ala 

1684 

I(Val)  =  425 

Ala-Leu-Ala 

1759 

I(Leu)  =  500 

Ala-Ile-Ala 

1764 

l(lle)  =  505 

A  la- Pro- A  la 

1734 

I(Pro)  =  475 

Ala-Phe-Ala 

2094 

I(Phe)  =  835 

Ala-Trp-Ala 

2649 

I(Trp)  =  1390 

Ala-Met-Ala 

1999 

I(Met)  *  740 

Ala-Gly-Ala 

1589 

l(Gly)  =  330 

Ala-Ser-Ala 

1849 

KSer)  =  590 

Ala-Thr-Ala 

1824 

I(Thr)  -  565 

Ala-Asn-Ala 

1864 

l(Asn)  =•  605 

Ala-Gln-Ala 

2069 

I(Gln)  -  810 

Ala-Cys-Ala 

2209 

I(Cvs)  -  950 

Ala-Tyr-Ala 

2419 

I(Tyr)  =  1160 

Ala-Asp-Ala 

1939 

I(Asp)  »  680 

Ala-Glu-Ala 

2029 

I(Glu)  =  770 

Ala-Lys-Ala 

2089 

I(Lys)  =  830 

Ala-Arg-Ala 

1939 

I(Arg)  =  680 

Ala- His- Ala 

2374 

I(His)  =  1115 

These  parameters  were  then  entered  into  LSTGEN  which  then  generated  a  library 
of  di-  to  penta-peptides  (A-B-C-D-E)  indexed  by  the  number  rpmputed  from  the 
formula; 

Rl  (A-B-C-D-E)  »  615  +  1(A)  +  1(B)  +  1(C)  +  1(D)  +  1(E). 

The  library  was  saved  in  a  disc  file. 
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Goal  3:  To  demonstrate  that  it  is  possible  to  determine  the  sequence  of  a 

test  peptide  and  to  estimate  the  overall  sensitivity  of  this  approach 

Glucagon  is  a  peptide  of  29  amino  acids  that  was  used  by  Biemann  as  a  test 
compound  for  his  original  experiments;  the  amount  used  for  his  experiments  was 
between  400  and  1000  nmoles.  The  primary  structure  of  glucagon  with  the  amides 
converted  to  acids  (as  would  occur  under  the  conditions  of  an  acid  hydrolysis)  is  as 
follows: 


1 

His-Ser-Glu-Glv-Thr-Phe-Thr-Ser-  Asp-Tvr-Ser-Lvs-Tvr-Leu-  Asn- 
16 

Ser-Arg-Arg- Ala-Glu- Asp-Phe- Va  I -Glu-Trp- Leu -Met- A  sp-Thr 


In  order  to  test  our  derivatization  procedures  and  software,  28.7  nmoles  of  glucagon 
was  submitted  to  partial  acid  hydrolysis  (6  N  HC1  at  120  “C  for  1  hour)  in  an  inert 
atmosphere  of  dry  Np;  the  resultant  mixture  of  peptides  was  derivatized  with  the 
above  procedures  and"  1  /30th  of  the  resulting  material  (approximately  1  nmole  of 
hydrolysate)  was  analyzed  by  GC-.MS.  The  total  ion  chromatogram  (TIC),  marked  with 
the  best  matches  identified  by  PEPALG,  is  shown  in  Figures  1-3.  It  is  qlear  that  the 
primary  structure  could  be  deduced  from  this  experiment  except  for  the  Arg-Arg 
region;  since  the  hydrolysate  had  not  been  subjected  to  hydrazinolysis,  this  gap  is 
not  unexpected.  A  second  sample  of  glucagon  was  treated  as  above  except  that  the 
hydrolysis  was  subjected  to  hydrazinolysis  prior  to  the  other  derivatization 
reactions;  the  TIC,  marked  with  the  best  matches  identified  by  PEPALG.  is  shown  in 
Figure  4  (Arg  is  converted  to  Orn  in  the  hydrazinolysis  reaction).  It  is  clear  that 
the  structure  of  the  Arg-Arg  region  is  determined  by  this  experiment. 

By  examining  the  peptides  identified  in  these  two  experiments,  it  is  clear  that 
there  are  sufficient  overlaps  to  reconstruct  the  primary  structure  of  the  underlined 
portions  of  the  glucagon  molecule.  It  is  especially  interesting  to  note  that  the 
amount  of  material  used  has  been  reduced  by  at  least  a  factor  of  20  and  that  the 
primary  structure  of  the  C-terminus  of  the  molecule  was  determined:  Trp-Leu-Met-Asp 
Thr. 

Goal  4:  To  extend  this  technique  to  polypeptides  of  pharmacological  interest 
in  the  range  of  150-200  amino  acids  in  length 

TNF  is  a  polypeptide  of  155  amino  acids  that  is  secreted  by  macrophages  and  lias 
cytotoxic  activity.  The  primary  structure  of  TNF  with  the  amides  converted  to  acids 
is  as  follows: 

1 

Ser-Ser-Ser-Arg-Thr-Pro-Ser-Asp-Lvs-Pro-Val-  Ala-His-Val-Val-Ala-Asp-Pro-Glu- 
20 

Ala-Glu-Glv-Glu-Leu-Glu-Trp-Leu-Asp-Arg-Arg-Ala-AsD- Ala-Leu-Leu-Ala-Asp-Glv- 
39 

Val-Glu-Leu-Arg-Asp-Asp-Glu-Leu- Val- Val-Pro-Ser-Glu-Glv-Leu-Tvr-Leu-Ile-Tvr- 
58 

Ser-Glu-Val-Leu-Phe-Lvs-Glv-Glu-Glv-Cvs-Pro-Ser-Thr-His- Val-Leu-I.eu-Thr-His- 
77 

Thr-Ile-Ser-Arg-Ile-Ala-Val-Ser-Tvr-Glu-Thr-Lvs- Val-Asp-Leu-Leu-Ser-Ala-Ile- 
96 

Lvs-Ser-Pro-Cvs-Glu-Arg-Glu-Thr-Pro-Glu-Glv- Ala-Glu -Ala-Lvs-Pro-Trp-Tvr-Glu- 
115 

Pro-lle-Tvr-Leu-Glv-GIv-Val-Phe-Glu-Leu-Glu-Lvs-Glv-Asp-Arg-Leu-Ser-Ala-Glu- 

134 

He- Asp- Arg- Pro- Asp-Tvr-Leu- Asp-Phe- A  la-Glu-Ser-Glv-Glu- Val-Tvr-Phe-Glv-lle- 
153 
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30  nmoles  of  TNF  was  hydrolyzed,  derivatized,  and  analyzed  as  described  above  for 
glucagon.  In  this  exploratory  experiment,  70  of  the  155  amino  acid  residues  were 
contained  in  the  polypeptides  identified  (underlined  areas  of  TNF  primary  struc¬ 
ture). 


STUDIES  PLANNED  FOR  YEAR  2  (11/1/87-10/31/88) 


Goal  I:  To  synthesize  and  purify  oligopeptides  of  the  form  Ala-X-Ala-Ala  and 
Ala-Ala-X-Ala-Ala  where  X  is  one  of  the  twenty  amino  acids 

Goal  2:  To  prepare  the  polvamino  alcohol  derivatives  of  these  oligopeptides 
and  study  their  GO  and  MS  properties 

Goal  3:  To  integrate  the  information  obtained  in  Goal  2  into  PEFALG 

Goal  4:  To  determine  a  sequence  of  enzymatic  reactions  for  preparing  a  more 
representative  mixture  of  oligopeptides  from  TNF 

Goal  5:  To  extend  this  technique  to  other  polypeptides  similar  in  size  to  TNF 
(human  growth  hormone,  IL-1) 

Goal  6:  To  initiate  the  study  of  the  conformation  of  TNF  in  solution  by  high 
resolution  NMR  and  molecular  modeling  and  parallel  structure/function 
studies 
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